Outlier-based Data Association: Combining OLAP and Data Mining

نویسندگان

  • Song Lin
  • Donald E. Brown
چکیده

Both data mining and OLAP are powerful decision support tools. However, people use them separately for years: OLAP systems concentrate on the efficiency of building OLAP cubes, and no statistical / data mining algorithms have been applied; on the other hand, statistical analysis are traditionally developed for two-way relational databases, and have not been generalized to the multi-dimensional OLAP data structure. Combining both OLAP and data mining may provide excellent solutions, and in this paper, we present such an example – an OLAP-outlier-based data association method. This method integrates both outlier detection concept in data mining and ideas from OLAP field. An outlier score function is defined over OLAP cells to measure the extremeness level of the cell, and when the outlier score is significant enough, we say the records contained in the cell are associated to each other. We apply our method to a real-world problem: linking criminal incidents, and compare our method with a similarity-based association algorithm. Result shows that this combination of OLAP and data mining provides a novel solution to the problem. Keyword: OLAP, data mining, data association, outlier

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Course Recommender by Combining Clustering and Fuzzy Association Rules

Each semester, students go through the process of selecting appropriate courses. It is difficult to find information about each course and ultimately make decisions. The objective of this paper is to design a course recommender model which takes student characteristics into account to recommend appropriate courses. The model uses clustering to identify students with similar interests and skills...

متن کامل

An Online Environment for Mining Association Rules in Multidimensional Data

Data warehouses and OLAP (online analytical processing) provide tools to explore and navigate through data cubes in order to extract interesting information under different perspectives and levels of granularity. Nevertheless, OLAP techniques do not allow the identification of relationships, groupings, or exceptions that could hold in a data cube. To that end, we propose to enrich OLAP techniqu...

متن کامل

An Outlier-based Data Association Method for Linking Criminal Incidents

Data association is an important data-mining task and it has various applications. In crime analysis, data association means to link criminal incidents committed by the same person. It helps to discover crime patterns and catch the criminal. In this paper, we present an outlier-based data association method. An outlier score function is defined to measure the extremeness of an observation, and ...

متن کامل

OLAP Mining: Integration of OLAP with Data Mining

OLAP mining is a mechanism which integrates on-line analytical processing (OLAP) with data mining so that mining can be performed in diierent portions of databases or data warehouses and at diierent levels of abstraction at user's nger tips. With rapid developments of data warehouse and OLAP technologies in database industry, it is promising to develop OLAP mining mechanisms. With our years of ...

متن کامل

OLAP on Complex Data: Visualization Operator Based on Correspondence Analysis

Data warehouses and Online Analysis Processing (OLAP) have acknowledged and efficient solutions for helping in the decisionmaking process. Through OLAP operators, online analysis enables the decision-maker to navigate and view data represented in a multi-dimensional manner. But when the data or objects to be analyzed are complex, it is necessary to redefine and enhance the abilities of the OLAP...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002